max rank | avg. rank | sentence |
---|---|---|
76 | 41.4000 | I just don't know who you have one of them! |
147 | 58.8462 | I would like them a long, long time or should I take them! |
165 | 73.1429 | I don't think people have these symptoms. |
178 | 54.0909 | I don't have just the right drugs because if you like. |
178 | 89.2222 | My CODEINE has been on the prescription dose, right? |
182 | 67.8889 | So, for people like me can then take them? |
199 | 93.5000 | Do not take any medicine without a prescription. |
200 | 95.3333 | But I'm here to tell their doctors about it. |
200 | 66.3000 | We do not like your doctor should tell you that. |
202 | 96.1111 | As one of those drugs that cause these effects. |
219 | 106.3333 | The study didn't say if any of these doctors. |
223 | 58.9000 | I don't know but, my doctor went out of me. |
225 | 69.9167 | If you put all the time, when people are in less pain). |
226 | 80.7143 | PERIACTIN may take this medication for you. |
231 | 74.2222 | I get more information on him and his work. |
232 | 85.2500 | When you are going to have these symptoms. |
233 | 94.7000 | If so, why did you get your meds for free? |
234 | 72.5000 | And people will go out of their way to help you when you really need it. |
234 | 64.9091 | I really don't think this would be your best for it? |
245 | 81.0000 | If you take these medications in the body. |
246 | 134.0000 | CODEINE doesn't cause symptoms like this. |
248 | 126.5000 | I used information from that site it's great. |
251 | 118.7778 | But now and then two years before the next. |
252 | 85.1667 | I don't have any bad side effects, just not much good effects. |
252 | 75.2727 | What did you think you need to be all that bad. |
254 | 93.3000 | I do when you try to keep the pills down. |
255 | 115.2500 | This medicine should not take the next one 2 to 3 months. |
256 | 73.6667 | You know at least as much as most of the best, or so they say. |
258 | 96.9091 | I would go to sleep better which made my day better. |
264 | 68.6000 | If a FIORICET is going to think about this drug. |
The maximum word rank of a sentence is by definition the rank of the rarest word in the sentence. If it is low, all words in the sentence are of high frequency. For this reason the table of the sentences with least maximum word number might be of interest. In the table, we see the corresponding sentences with a minimum length of 40 characters.
The over all distribution of the maximum rank in all sentences of the corpus is shown in a diagram with log-scaled x-axis.
The sentences in the table described above are of interest because they are usually easy to understand. The distribution may give insights into the corpus and may give parameters for language comparison.
While the distribution might be deduced from a small corpus, the sentences in the table are rare and a large corpus will give more impressive results.
Table data:
select max(w_id)-100 as m, avg(w_id)-100 as a, s.sentence from sentences s, inv_w i where s.s_id=i.s_id and length(sentence)>40 and i.w_id>100 group by s.s_id order by m limit 30;
Distribution data;
select m, count(*) from (select 100* round((max(w_id)-100)/100) as m from sentences s, inv_w i where s.s_id=i.s_id and i.w_id>100 group by s.s_id) aa group by m;
Explain the distribution, especially the increase in its right part.
4.5.2.2 Average word rank in sentence
4.5.2.3 Sentences consisting of many low frequency words I
4.5.2.4 Sentences consisting of many low frequency words II
4.5.2.5 Sentences consisting of short words only I
4.5.2.6 Sentences consisting of short words only II
4.5.2.7 Sentences consisting of long words only I
4.5.2.8 Sentences consisting of long words only II